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SYSTEM AND METHOD FOR ENHANCING THE SPEED OF DYNAMIC TIMING 
SIMULATION USING DELAY ASSESSMENT AT COMPILE TIME 

FIELD OF THE INVENTION 

Embodiments of the present invention relate to the field 
electronic design automation (EDA) . More particularly, 
embodiments of the present invention relate to the simulation 
of logic circuits, 

PACtCgRQ^ND ART 

The manufacture of integrated circuits at very large 
scale integration (VLSI), or ultra large scale integration 
(ULSI), requires a large investment in time and resources. 
Before committing a circuit design to the manufacturing 
process it is desirable to validate the circuit design 

Validation of a design is the process of examining the 
behavior of the design implementation in light of the 
specification for the design. Validation can be done in a 
variety of ways and includes both static analysis and 
dynamic analysis techniques* There are currently many 
computer aided design (CAD) packages that are used 
specifically for circuit validation, and these packages are 
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generally referred to as electronic design automation (EDA) 
tools. 

In static analysis the circuit is analyzed without input 
data and time dependent behavior is not considered. For 
example, a circuit can be modeled as a resistance-capacitance 
(RC) tree, or sequential elements followed by combinational 
elements, in order to determine the approximate maximum 
signal path delays. Static analysis yields maximum delay 
information and can be used to find gross design violations; 
however, it has the disadvantage of occasionally reporting 
violations for circuit paths that are not used in the 
operation of the circuit because it does not consider the 
behavior of the circuit with actual input signals. 
Conventional static timing also has the disadvantage of 
reporting violations in parts of the circuit that are not of 
interest (e.g., previously tested portions of the circuit). 
Static analysis is also unable to account for data-dependent 
delays or handle certain circuit topologies. 

In dynamic analysis a set of input data is generated and 
the observation of the time dependent behavior of the circuit 
is observed in response to the input. The application of a 
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set of input data and the observation of the response is 
called simulation. In simulation, the design implementation 
is usually expressed in terms of a netlist that is generated 
from a hardware description language such as VERILOG or VHDL. 
Of the EDA tools in use, simulators are the most significant 
in terms of cost and time. 

Due to the complexity of VLSI and ULSI circuits, the 
cost of a fully comprehensive design simulation is 
prohibitive, and there are tradeoffs between accuracy, speed 
and coverage. Thus, considerable effort has been focused on 
finding ways to improve the speed and efficiency of the 
simulation process. 

Simulators spend time in two separate phases: 
compilation and runtime. Normally, runtime is the more 
significant of the two, particularly if the design is being 
analyzed at the gate level rather than the register transfer 
level (RTL) . Since runtime typically requires considerably 
more time than compilation, it is desirable to find methods 
of compiling the circuit for simulation that will reduce the 
overall time required for the runtime phase. 
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SUMMARY OF INVENTION 

Accordingly, embodiments of the present invention provide a 
method for accelerating the runtime phase of a logic circuit 
5 simulation. Embodiments of the present invention improve the 

performance of dynamic simulation by assessing the delays during 
compile time. The improvements in the performance are achieved 
without sacrificing the accuracy of the dynamic simulation, 

A method for reducing the time required for execution of 
p 10 the runtime phase for a logic circuit simulator is 

disclosed. For a logic circuit simulator having a compilation 
phase and a runtime phase e.g., dynamic simulation, a delay 
assessment is performed during the compilation phase in order 
to identify storage elements that are exempt from possible 
15 timing violations at runtime. Runtime timing checks are 
removed from the exempt storage elements, thereby reducing 
the dynamic timing simulation effort. Additionally, 
combinational portions of the circuit that drive the storage 
elements are examined (during backward traversal processes) 
20 for elements that can be effectively eliminated from the 

dynamic timing simulation calculations (by introducing zero 
delay) . The backward traversal process may be made from all 
storage elements, or may be limited to origination from 
exempt (safe) storage elements. The introduction of zero 
25 delays in the combinational portion of the circuit provides a 
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further reduction in the computational overhead during 
dynamic timing simulation, by the use of cycle based 
simulation for these gates. Similarly^ the removal of delays 
from the clock circuits associated with the sequential 
5 elements also provides a significant reduction the 
computation required during simulation. 

In an embodiment of the present invention, a 
combinational portion of a logic circuit is represented as a 
10 network of nodes interconnected by signal paths that have 
associated delays. The inputs and outputs for the 
combinational portion of the logic circuit are connected to 
clocked storage elements, e.g* flip--flops, that are operated 

01 with a particular clock period. The maximum and minimum 

0- 

r- 15 delays at an output flip-flop are determined and compared to 
the maximum and minimum allowable delays at the output flip- 
IP flop. If the maximum delay does not violate the maximum 

allowable delay and the minimum delay does not violate the 
minimum allowable delay, then the output flip-flop is 
20 referred to as "safe" or "exempt". If either the minimum 

allowable delay or the maximum allowable delay is violated, 
the flip-flop is referred to as "unsafe" or "non-exempt". 

Timing checks are removed from storage elements that are 
25 classified as safe. The removal of timing checks during the 
compilation phase provides a first reduction in computational 
effort of dynamic timing simulation. Additionally, the 
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combinational portion of the logic circuit is examined by 
stepping back along the paths leading into the output flip- 
flops and path elements having delays that can be removed are 
identified, A second reduction in computational, effort is 
achieved by removing the identified path element delays from 
the dynamic timing simulation calculations by setting their 
value to zero during the compilation phase, as a result of 
delay assessment information. 

In another embodiment, a major reduction in 
computational effort is achieved by setting the delays 
associated with the clock skew (e.g. delays associated with 
clock buffers) to zero for safe output flip-flops that 
terminate a combinational path. Since clock circuits are 
essentially always active, the removal of delays from the 
simulation of the clock circuits is significant. 

In yet another embodiment, the clock- to~Q delays of 
input flip-flops may be removed. Removal of the clock- to-Q 
delays of input flip-flops is determined by the treatment of 
the gates connected to the input flip-flops. 

In general, for all gates having zero delay, cycle based 
simulation algorithms can be applied during dynamic timing 
simulation in one embodiment. 
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These and other objects and advantages of the present 
invention will no doubt become obvious to those of ordinary 
skill in the art after having read the following detailed 
description of the preferred embodiments which are 
illustrated in the various drawing figures. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 illustrates a computer system forming a part of 

5 a system in accordance with an embodiment of the present 
claimed invention. 

Figure 2A shows a logic circuit block to which an 
embodiment of the present claimed invention may be applied. 

10 

Figure 2B shows a delay schematic for the combinational 
portion of the logic circuit of Figure 2A. 

Figure 3 shows a table for forward delay sums produced 
15 in accordance with an embodiment of the present claimed 
invention. 

Figure 4 shows a delay data table used for delay 
assessment in accordance with an embodiment of the present 
20 claimed invention. 



Figure 5 shows a table 
produced in accordance with 
claimed invention. 
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for reverse delay differences 
an embodiment of the present 

8 



Figure 6 shows the logic circuit delay schematic of 
Figure 2B with maximum forward delay sums and minimum reverse 
delay differences assigned to nodes in accordance with an 
embodiment of the present invention. 

Figure 7A shows the delay removal associated with an 
aggressive optimization in accordance with an embodiment of 
the present invention. 

Figure 7B shows the delay removal associated with a 
standard optimization in accordance with an embodiment of the 
present invention. 

Figure 8 shows a general flow chart for a method 
embodiment of the present invention. 

Figure 9 shows a general flow chart for the 
determination of forward maximum delay sums and reverse 
minimum delay differences in accordance with a method 
embodiment of the present invention. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 



In the following detailed description of the present 
invention, a system and method for enhancing the speed of 
dynamic logic simulation using information obtained from a 
delay assessment; numerous specific details are set forth in 
order to provide a thorough understanding of the present 
invention. However, it will be obvious to one skilled in the 
art that the present invention may be practiced without these 
specific details. In other instances well known methods 
involving schematic representation and the characterization 
of lumped and distributed circuit properties, etc., have not 
been described in detail so as not to unnecessarily obscure 
aspects of the present invention. 

Notation and Nomenclature 

Some portions of the detailed descriptions which follow 
are presented in terms of procedures, logic blocks, 
processing and other symbolic representations of operations 
on data bits within a computer memory. These descriptions and 
representations are the means used by those skilled in the 
data processing arts to most effectively convey the substance 
of their work to „others skilled in the art. A procedure, 
logic block, process, etc., is here, and generally, conceived 
to be a self-consistent sequence of steps or instructions 
leading to a desired result. The steps are those requiring 
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physical manipulations of physical quantities. Usually, 
though not necessarily, these quantities take the form of 
electrical or magnetic signals capable of being stored, 
transferred, combined, compared, and otherwise manipulated in 
a computer system. It has proven convenient at times, 
principally for reasons of common usage, to refer to these 
signals a bits, values, elements, symbols, characters, terms, 
numbers, or the like. 

It should be borne in mind, however, that all of these 
and similar terms are to be associated with the appropriate 
physical quantities and are merely convenient labels applied 
to these quantities. Unless specifically stated otherwise as 
apparent from the following discussions, it is appreciated 
that throughout the disclosure of the present invention, 
terms such as ^'processing" or '"computing" or ''calculating'' 
or "computing" or "determining" or "displaying" or the like, 
refer to the action and processes of a computer system, or 
similar electronic computing device, that manipulates and 
transforms data represented as physical (electronic) 
quantities within the computer system's registers and 
memories into other data similarly represented as physical 
quantities within the computer system's registers or memories 
or other such information storage, transmission or display 
devices . 
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Refer to Figure 1 which illustrates a general purpose 
computer system 112 • In general/ computer systems 112 used by 
an embodiment of the present invention comprise a bus 100 for 
communicating information, a central processor 101 coupled 
with the bus 100 for processing information and instructions, 
a random access memory 102 coupled with the bus 100 for 
storing infonnation and instructions for the central 
processor 101, a read only memory 103 coupled with the bus 
100 for storing static information and instructions for the 
processor 101, a data storage device 104 such as a magnetic 
or optical disk and disk drive coupled with the bus 100 for 
storing information and instructions, an optional display 
device 105 coupled to the bus 100 for displaying information 
to the computer user, an alphanumeric input device 106 
including alphanumeric and function keys coupled to the bus 
100 for communicating user input information and command 
selections to the central processor 101, cursor control 
device 107 coupled to the bus for communicating user input 
information and command selections to the central processor 
101, and a signal generating device 108 coupled to the bus 
100 for communicating command selections to the processor 
101. 

Figure 2A shows an example of a logic circuit block 200 
to which the present invention may be applied. The circuit 
includes a master clock CI, input flip-flops FFIl, FFI2, 
FFI3; output flip-flops FFOl, FF02, FF03; logic gates Gl, G2, 
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G3, G4, G5, G6; buffers Bl, B2, B3, B4, B5, B6, B7. Primary 
inputs PIl, PI2 and PI3 are provided at input flip-flops 
FFIl, FFI2 and FFI3, respectively. Primary outputs POl, P02 
and P03 are provided at output flip-flops FFOl, FF02 and 
FF03, respectively. The primary outputs and inputs may also 
be intermediate input and outputs without a loss of 
generality, the primary inputs may terminate a preceding 
combinational path and the primary outputs may feed a 
succeeding combinational path. Each gate, flip-flop and 
buffer has associated delays that must normally be accounted 
for during dynamic simulation. In a method embodiment of the 
present invention the delays of these circuit elements are 
assessed, and subject to certain criteria, may be removed 
from the dynamic simulation. 

Figure 2B shows a logic circuit delay schematic for the 
combinational portion of the logic circuit of Figure 2A. The 
nodes labeled A, B, and C represent gates Gl, G2, and G3 
respectively. The nodes labeled D, E, and F represent gates 
G4, G5, and G6 respectively. The nodes Af B, C, D, E, and F 
have delays of 5, 5, 10, 5, 10, and 10 nanoseconds, 
respectively. The schematic is intended to show the paths 
traversed during the delay assessment of the present 
invention and as such does not reflect the electrical 
connections of Figure 2A. Each node contains a delay 
associated with its respective gate. In this example, a 
single value is shown; however, the gate delays may be data 
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dependent- Also^ for purposes of this example, the 
interconnect delays are assumed to be zero so that present 
invention may be clearly described by a quantitative example. 
In practice, the interconnect delays are typically provided 
in a standard delay format (SDF) file and accounted for, 

A purely combinational path bounded by an input flip- 
flop FFi and an output flip-flop FFj will by constrained by 
upper and lower bounds with regard to the total permissible 
delay for data from the input flip-flop FFi, The upper bound 
is established by the clock period P minus the setup time TSj 
for the output flip-flop FFj . The sum of the clock-to-Q delay 
Tqi of the input flip-flop FFi and the delay associated with 
the combinational portion must be less than or equal to P - 
TSj. Equivalently, the delay associated with the combinational 
path must be less than or equal to P - Tqi - TSj . This 
constraint is referred to as the setup time constraint. 

The lower bound for the permissible delay associated 
with the input flip-flop FFi the combinational path is 
established by the clock period P and the hold time Thj of the 
output flip-flop FFj. The delay associated with the 
combinational path must be greater than or equal to P - Thj - 
Tqi, This constraint is referred to as the hold time 
constraint. 
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A "safe delay period" that is greater than or equal to 
the hold time constraint and less than or equal to the setup 
time constraint is thus established at each output flip_flop 
FFj for the allowable accumulated delay for an associated 
combinational path. The output flip-flops FFj are 
characterized as either safe or non-safe on the basis of the 
safe delay period, with an output flip-flop being 
characterized as safe if the delays of all associated 
combinational paths fall within the safe delay period of the 
flip-flop. 

The safe delay period may also account for the effects 
of clock skew between the FFi and FFj that bound a 
combinational path. The clock delay at the input flip-flops 
FFi with respect to the master clock is added to the safe 
delay period, and the clock delay at the output flip-flops FFj 
with respect to the master clock is subtracted from the safe 
delay period. 

If an output flip-flop FFj has an associated 
combinational path for which the total delay exceeds the safe 
delay period, it is classified as non-exempt or non-safe. If 
all combinational paths associated with an output flip-flop 
FFj are within the safe delay period, then the FFj is 
classified as safe or exempt. 
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The delays associated with a physical circuit path in a 
logic circuit are generally attributable to a combination of 
lumped and distributed resistances and capacitances 
associated with the devices and their interconnects. Thus 
5 both devices (e.g., a gate or a single transistor) and their 
interconnects may contribute to the total delay in a circuit 
path. Depending upon the desired resolution, the fundamental 
device used in delay assessment can be an individual 
transistor, a gate, or a larger logic block. Generally, a 
10 gate is used as the fundamental circuit element. 

P;^ For logic circuit elements, data dependent delays may 

ft introduce several different alternative values for delays 

that are dependent upon the nature of a transition at a gate. 

5' 

0 15 For example, if there are four logic states available at the 
h^. input and output of a gate, there will be twelve possible 
M transitions, and each transition may have a different delay 
h associated with it. The delay parameters are usually 

available at compile time in a standard delay format (SDF) 
20 file. For simplicity, the present example shows a single 

delay value for all transitions. 

Data dependent delays may also be found cascaded in a 
combinational path. Consecutive data dependent delays may be 
25 combined as necessary for accurate delay assessment. For 
example, a buffer followed by an inverter will have a rise 
delay added to a fall delay, respectively. 
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In a method embodiment of the present invention, the 
total delay associated with each possible forward path at a 
node is determined for each node. For example, nodes A, B, 
and C have forward delays of 5, 5, and 10 nanoseconds. Nodes 
D, E, and F have multiple associated delays since there are 
multiple inputs. 

Figure 3 shows a tabulation of all of the possible 
forward delay sums associated with the paths traversing the 
schematic of Figure 2B. In the forward traverse, the delays 
are sijmmed across each node to provide a forward delay sum at 
the output of the node. For each node, there is a maximum 
forward delay sum and a minimum forward delay sirni. The 
maximum forward delay sum for each node in Figure 3 is 
underlined, and the minimum forward delay sum is circled. 

Figure 4 shows a delay data table used for delay 
assessment in accordance with an embodiment of the present 
claimed invention. Delay data is shown for output nodes G, H 
and I. Nodes G, H, and I are associated with the inputs of 
output flip-flops FFOl, FF02, and FF03 of Figure 2A, 
respectively. For this example, the clock period of the 
master clock CI of Figure 2A is 40 nanoseconds. Tqi, Tgj, and 
Tfij represent the clock-to-Q of the input flip_flop, setup 
time of the output flip_flop, and hold time of the output 
flip-flop, respectively. 
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It should be noted that the scheme shown in Figure 4 is 
only one example of an approach for obtaining a lower bound 
(minimum allowable combinational delay) or hold time 
constraint, and an upper bound (maximum allowable 
combinational delay) or setup time constraint. These upper 
and lower bounds on the delay allowed in the combinational 
portion of the circuit may be derived from other techniques. 

Referring again to Figure A, the minimum forward delay 
sum is compared to the hold time constraint (lower bound) , 
and the maximum forward delay sum is compared to the setup 
time constraint (upper bound) . If the minimum forward delay 
sum is greater (or equal to) than the hold time constraint, 
and the maximiun forward delay svm is less than (or equal to) 
the setup time constraint, the output flip-flop associated 
with the output node is considered safe. This comparison is 
made at each output node, and if either the minim\am or 
maximum constraint is violated, then the node is considered 
unsafe. 

In the example of Figure 4, Output Nodes D (10 less than 
20) and E (15 less than 35) are safe, whereas maximum forward 
delay sum of Output Node F (20 greater than 19) violates the 
setup time constraint, and thus the flip-flop associated with 
Output Node F is unsafe. The safe and unsafe designation is 
shown in Figure 6. Thus, output flip-flops FFOl and FF02 of 
Figure 2A would be safe, and output flip-flop FF03 of Figure 
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2A would be unsafe. The timing checks that are conventionally 
in place during dynamic simulation are removed during compile 
time from the nodes that are found to be safe. 

In addition to removing the timing checks on output 
flip-flops that are safe, the delays associated with the 
clock buffers that drive the safe flip-flops are removed as 
well. Thus, the delays that are associated with buffers B2 
and B4 of Figure 2A would be removed during dynamic 
simulation. 

Figure 5 shows a table for reverse delay differences 
produced in accordance with an embodiment of the present 
claimed invention. Referring again to Figure 2B, a series of 
reverse delay differences is obtained by a backward traversal 
from each output node along each possible combinational path. 
Starting from the output node, the delay associated with each 
node is subtracted from the setup time constraint (upper 
bound) and assigned to the input side of the node as a 
reverse delay difference, for example, the reverse delay 
difference at node D is equal to: 26 (setup time constraint 
for node G from Figure 4) - 5 (delay associated with node D) 
= 21. Similarly, the reverse delay difference at node A from 
path DA is equal to 26 - 5 - 5 = 16. The minimum reverse 
delay difference for each node is underlined in Figure 5. 

Figure 6 shows the logic schematic of Figure 2B with the 
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addition of a comparison of forward maximum delay sums and 
minimum reverse delay differences. The maximum forward delay 
sums are shown in circles with an arrow pointing to the 
right, and the minimum reverse delay differences are shown 
with an arrow pointing to the left. At a given node, the 
maximum delay sum represents the worst case (greatest) delay 
sum for all paths leading into that node. Similarly, the 
minimum reverse delay difference represents the amount of 
delay that can be tolerated in any path entering the node, 
such that total delay does not violate the setup time 
constraint (upper bound) . 

If the minimum reverse delay difference at a node is 
greater than the forward maximum delay difference at that 
node, then all adjacent nodes on paths leading forward out of 
the node can have their associated delays set to zero during 
compile time without compromising the accuracy of the dynamic 
simulation. Forward delay sums and reverse delay differences 
may include the delays associated with interconnects. For 
example, the delay sums shown in figure 3 for node A would be 
incremented by some amount when compared to the reverse delay 
difference at node D or node E. Similarly, the reverse delay 
differences would be decremented by some amount. 

The reduction to zero of delays can also be extended to 
the clock-to-Q delay associated with an input flip-flip such 
as FFIl, FFI2, and FFI3 of Figure 2A. An input flip-flop 
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having one or more gates connected to its output may have its 
clock-to-Q delay reduced to zero during compile time if all 
of the connected gates have had their delays reduced to zero, 
The elimination of the clock-to-Q delay at qualified input 
flip-flops provides a further improvement in dynamic 
simulation. The reduction to zero of delays associated with 
gates and flip-flops will also be influenced by whether a 
standard or aggressive optimization is used. 

Figure 7A shows a ''reduced" logic circuit schematic of 
Figure 6, with selected delays set to zero and selected 
timing checks removed. Each node that meets the criterion of 
having a reverse minimum delay difference that is greater 
than the forward maximum delay sum is set to zero delay. The 
reduction shown in Figure 7A is referred to as an aggressive 
optimization. Only Nodes C and F retain non-zero delay 
values, whereas the remainder are set to zero for purposes of 
dynamic simulation. With respect to the application of 
aggressive optimization to Figure 2A, the input flip-flops 
FFIl and FFI2 would have their clock-to-Q delays removed for 
dynamic simulation, whereas FFI3 would not. 

Figure 7B shows a '"reduced" logic circuit schematic 
similar to that of Figure 7A, with a standard optimization 
used in place of an aggressive optimization. Standard 
optimization has the constraint that nodes on combinational 
paths connected to an unsafe node are not eligible for 
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reduction to zero. Thus, standard optimization has three 
nodes that have not been reduced to zero (nodes C, and F) . 
With respect to the application of standard optimization to 
Figure 2A, input flip-flop FFIl would have its clock-to-Q 
delay removed for dynamic simulation, whereas FFI2 and FFI3 
would not. 

Since timing checks are not removed from unsafe nodes 
due to the potential for timing violations, the standard 
optimization prevents the introduction of ''zeroed" delays in 
paths leading to unsafe nodes. The introduction of a zeroed 
path element may in remote cases distort the timing 
information reported at an unsafe node. 

It should be noted that the ''greater than" criterion 
described above could also be expressed as a greater than or 
equal" criterion by appropriate adjustment of the safe delay 
period values or other parameters. 

Figure 8 shows flow chart 800 for the general overall 
process for a method embodiment of the present invention. A 
netlist 805 is used as input for both a delay assessment step 
810 and for a dynamic timing simulation step 820. 

In step 810, a delay assessment is perfomed using the 
delay data provide by the netlist 805. The delay data may 
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include delays for individual gates or transistors, and may 
also include delays for interconnect wires between devices. 
The delay data is typically provided in a standard delay 
format (SDF) file. 

In step 815, timing checks are removed at safe storage 
elements, and path elements for which delays can be set to 
zero are identified. Additionally, the clock buffers driving 
safe output storage elements may be zeroed, and the clock~to- 
Q delays of qualified input storage elements (flip-flops) may 
be zeroed. 

In step 820, the data from the netlist 805 and the 
revised timing checks and delays from step 815 are combined 
to perform the dynamic timing simulation with a reduced 
overhead to give the result 825. 

Figure 9 shows flow chart 900 for a method embodiment of 
the present invention. The series of steps shown describe the 
actions performed during compilation of a logic circuit 
simulation. It should be noted that the steps shown are not 
required to be performed in the sequence shown. 

In step 905, a combinational portion of the circuit is 
defined. The combinational circuit portion will typically be 
represented as a network of input nodes, interior nodes and 
output nodes that are interconnected. The delays associated 
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with the path elements correspond to the delays found in the 
interconnects and devices (e.g., gates) of the circuit being 
simulated • Logic signals are propagated from the input nodes 
of the circuit through the network to the output nodes. The 
output nodes are typically associated with storage elements, 
for example^ flip-flops that may share a common clock domain 
with the combinational circuit portion being simulated. 

In step 910, the forward delay sums for each node are 
determined. At the input nodes the delay sum is nominally 
zero. The reference point for delay assessment at the input 
may be set before or after an input storage element. For each 
interior node and each output node the delay along each path 
entering the node is found by adding the delays associated 
with the path elements that make up the path. The largest 
value for the forward delay sums at a node is designated as 
the maximum forward delay sum for that node. Examples of path 
elements are gates and interconnects. 

In one approach to determining the maximum forward delay 
sum at a given node, all possible delay sums are determined 
and the largest value is selected, with the process being 
repeated for each node. In an alternative approach, the 
maximum forward delay sum is determined node by node, using 
the forward maximum delay sum value from a connected node on 
as the basis for the sum at the next node, thus avoiding the 
repetition of calculating intermediate sums. 
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In step 915, the safe delay period at each output node 
is determined. For example, the safe delay period may have 
the hold time constraint as a lower bound and the setup time 

5 constraint as an upper bound. Based upon the safe delay 

period, the output nodes are classified as safe or unsafe. If 
all of the combinational paths leading in to an output node 
are within the safe period, the node is safe. If an output 
node is connected to a path that violates either the hold 

10 time constraint (lower bound) or the setup time constraint 
(upper bound), the node is unsafe. 

In step 920, the timing checks are removed from the safe 
storage elements. A storage element associated with an output 
15 node is considered safe if the maximum forward delay sum at 
an output node is less than the safe delay period for that 
node. The removal of timing checks provides a reduction in 



W the computational overhead. The removal of the timing checks 

may also include the setting to zero of clock buffers driving 
20 the safe output storage elements, while retaining the delays 
of clock buffers driving unsafe output storage elements. 



In step 925, the minimum reverse delay difference for 
each node is determined. Beginning at each output storage 
25 element, a delay difference is found for each connected node 
by subtracting the delay associated with the node from the 
upper bound (setup time constraint) . In aggressive 
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optimization, the backward traversal is initiated at output 
storage elements, whereas in the standard optimization the 
backward traversal is not performed on paths connected to 
unsafe output storage elements. One or more delay differences 
may be calculated at a given node, and the lowest value is 
designated as the minimum reverse delay difference. At each 
node in a given path, the reverse delay difference may be 
determined with respect to each adjacent connected node by 
using the minimum delay difference at the adjacent connected 
node. A reverse delay difference at a given node is found by 
subtracting delay at the node from the reverse delay 
differences of the preceding connected nodes. Interconnect 
delays may also be subtracted as appropriate. 

In step 930, the maximum forward delay sum and the 
minimum reverse delay difference at each node other than the 
output nodes is compared. Delays at nodes having a reverse 
minimum delay difference that is greater than the forward 
maximum delay sum can be ignored, and are set to zero for 
compilation purposes. This elimination of combinational 
delays further decreases the runtime phase computational 
overhead. In addition, those input storage elements for which 
all connected combinational elements have been reduced to 
zero are qualified (for delay reduction to zero) , and their 
associated clock-to-Q delays are set to zero. If an input 
storage element is connected to gate that has not had its 
delay zeroed, the clock-to-Q delay remains. 
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In step 935, a dynamic simulation is performed for the 
netlist. Cycle based simulation may be used for those storage 
elements for which timing checks have been removed, and path 
element delays for which delays have been set to zero are 
effectively ignored. 

The foregoing descriptions of specific embodiments of 
the present invention have been presented for purposes of 
illustration and description. They are not intended to be 
exhaustive or to limit the invention to the precise forms 
disclosed, and obviously many modifications and variations 
are possible in light of the above teaching. The embodiments 
were chosen and described in order to best explain the 
principles of the invention and its practical application, to 
thereby enable others skilled in the art to best utilize the 
invention and various embodiments with various modifications 
are suited to the particular use contemplated. It is 
intended that the scope of the invention be defined by the 
Claims appended hereto and their equivalents. 
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